Date: Tue, 05 Nov 1996 21:58:40 GMT
Server: NCSA/1.5
Content-type: text/html
Last-modified: Fri, 25 Oct 1996 18:21:39 GMT
Content-length: 2170

<html>
<head>
<title>DJF's Reinforcement Learning Page</title>
</head>

<body>

<h1> Some Reinforcement Learning Resources</h1>
<p>
<h2> <!WA0><!WA0><!WA0><!WA0><!WA0><a href="http://www.cs.wisc.edu/~finton/djfpubs.html">
      My publications</a></h2>
<p>
<h2> Short subjects:</h2>
<ul>
<li> <!WA1><!WA1><!WA1><!WA1><!WA1><a href="http://www.cs.wisc.edu/~finton/what-rl.html">
     What is reinforcement learning, and why is it hard?</a>
<li> <!WA2><!WA2><!WA2><!WA2><!WA2><a href="http://www.cs.wisc.edu/~finton/ibfe.html">
     What is Importance-Based Feature Extraction?</a>
</ul>

<p>
<h2> Simulation code for several control problems:</h2>
<ul>
<li> <!WA3><!WA3><!WA3><!WA3><!WA3><a href="http://www.cs.wisc.edu/~finton/poledriver.html">
     Pole-cart problem, driver module</a>---Just the driver;  supply your
     own controller
<li> <!WA4><!WA4><!WA4><!WA4><!WA4><a href="http://www.cs.wisc.edu/~finton/qcontroller.html">
     Sample Q-learning controller module</a>---Suitable for use with the
     pole-cart driver module.  (Currently, doesn't use probabilistic action 
     selection).
<li> <!WA5><!WA5><!WA5><!WA5><!WA5><a href="ftp://ftp.cs.umass.edu/pub/anw/pub/sutton/pole.c">
     Barto-Sutton-Anderson pole-cart solution</a>
<li> <!WA6><!WA6><!WA6><!WA6><!WA6><a href="http://www.cs.colostate.edu/~anderson/#software">
     Chuck Anderson's public domain code for neural networks and 
     reinforcement learning</a>
<li> <em>Suggestions on additional links?</em>
</ul>

<p>
<h2> Other RL resources:</h2>
<ul>
<li> <!WA7><!WA7><!WA7><!WA7><!WA7><a href="http://envy.cs.umass.edu/People/sutton/RLinterface/RLinterface.html">
     Proposed Standard for Reinforcement Learning Software</a>, by
     Rich Sutton and Juan Carlos Santamaria
<li> <!WA8><!WA8><!WA8><!WA8><!WA8><a href="ftp://archive.cis.ohio-state.edu/pub/neuroprose/">
     NeuroProse Archive (Ohio State University)</a>
<li> <!WA9><!WA9><!WA9><!WA9><!WA9><a href="ftp://ftp.gmd.de/Learning/rl/">
     GMD Reinforcement Learning Archive</a>
<li> <!WA10><!WA10><!WA10><!WA10><!WA10><a href="http://envy.cs.umass.edu/People/sutton/sutton.html">
     Rich Sutton's</a> home page and RL archive
<li> <!WA11><!WA11><!WA11><!WA11><!WA11><a href="http://www.idiap.ch/html/idiap-networks.html">
     IDIAP Neural Network Home Page, including links to conferences</a>
</ul>

<p>

<hr>

<!WA12><!WA12><!WA12><!WA12><!WA12><a href="http://www.cs.wisc.edu/~finton/finton.html">
David J. Finton</a>, <!WA13><!WA13><!WA13><!WA13><!WA13><a href="mailto:finton@cs.wisc.edu">
<em>finton@cs.wisc.edu</em></a>, October 25, 1996.


</body>
</html>

